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ABSTRACT 

Massive galaxies at high-z have smaller effective radii than those today, but similar cen- 
tral densities. Their size growth therefore relates primarily to the evolving abundance of 
low-density material. Various models have been proposed to explain this evolution, which 
have different implications for galaxy, star, and BH formation. We compile observations of 
spheroid properties as a function of redshift and use them to test proposed models. Evolution 
in progenitor gas-richness with redshift gives rise to initial formation of smaller spheroids at 
high-z. These systems can then evolve in apparent or physical size via several channels: (1) 
equal-density 'dry' mergers, (2) later major or minor 'dry' mergers with less-dense galaxies, 
(3) adiabatic expansion, (4) evolution in stellar populations & mass-to-light-ratio gradients, 
(5) age-dependent bias in stellar mass estimators, (6) observational fitting/selection effects. 
If any one of these is tuned to explain observed size evolution, they make distinct predic- 
tions for evolution in other galaxy properties. Only model (2) is consistent with observations 
as a dominant effect. It is the only model which allows for an increase in MBn/A^buige with 
redshift. Still, the amount of merging needed is larger than that observed or predicted. We 
therefore compare cosmologically motivated simulations, in which all these effects occur, & 
show they are consistent with all the observational constraints. Effect (2), which builds up an 
extended low-density envelope, does dominate the evolution, but effects 1, 3, 4, & 6 each con- 
tribute 20% to the size evolution (a net factor ~ 2). This naturally also predicts evolution in 
Mbh — o" similar to that observed. 

Key words: galaxies: formation — galaxies: evolution — galaxies: active — quasars: general 
— cosmology: theory 



1 INTRODUCTION 

Observations have suggested that high-redshift spheroids have 
significantly smaller effec tive radii th an low-redshift a nalogues 
of the same mass (e. g. iDaddi etall |2005; van Dok kum et al.l 
20081: IZirm etal] 120071: iTruiillo et alj Fo oeb: Franx et aD '20M 



ulations have been linked to observed sub-millimeter galaxies, the 
most rapidly star-formi ng objects in th e Universe , and bright, high- 
redshift quasar hosts j Younger et alj 2008b; T acconietalj|2008l : 
iHopkins et"al]|2008j |bl; l Alexander et al.il2008i) . As such, these ob- 



Damianov et alj|2009l:lvan der Wei et ^2 008: Cimatti et al. 26o|; 



Truiillo et alj|200l l2006al : IXoft et alj |2007; Buitrago et al. 2008) 



Whatever process explains this apparent evolution must be particu- 
lar to this class of galaxies: disk galaxies do not become similarly 
comp act at high redshift jSomerviUe et al.l l2008l : iBuitrago et al.l 
I2OO8L and references therein)[3 In addition, these high-redshift pop- 
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' There are of course different ways of defining galaxy "density" or 
"compactness," for which the disk/spheroid difference is not the same (see 



e.g. iBuitrago et al ]^2008'). In this paper, we will generally use these terms 
to refer to the coarse-grained phase-space density, which can be approx- 
imated by / ~ M/{R^V^) ~ 1/{GR^V), where R is the characteristic 
major-axis radius (proportional to e.g. effective radius of spheroids or disk 
scale-length), and V the characteristic velocity (dispersion or circular ve- 
locity). This is the quantity of theoretical interest as, in dissipationless pro- 
ce sses, it is conserved o r decreased (/unai < /initial : see e.g. the discussion 
m iHernquist et"ai]| 19931) . Quantities such as the orbital streaming motion 
and disk scale heights enter in the fine-grained phase-space density, which 
although formally conserved is not observable. Similarly, as discussed in 
§[2] we take "size" to refer to the semi-major axis half-light size. 
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servations represent a strong constraint on models of galaxy and 
bulge formation. 

More recently, iHopkins et all j2009ah showed that the cen- 
tral densities of these high-redshift systems are similar to those of 
massive ellipticals at low redshifts; the primary difference between 
"small" (high-redshift) and "large" (low-redshift) systems relates 
to the amount of observed low-density ma terial at la rge radii, ab- 
sent in the high-redshift systems (see also iBezanson et al. 2009V 
There are therefore two important, related questions. First, how do 
high-redshift massive spheroids form, apparently without low den- 
sity material, but with their dense cores more or less in place rela- 
tive to their z = descendants? And second, do these early-forming 
systems "catch up" to later-forming massive counterparts by accu- 
mulating such low-density material? If so, how? 

High-resolution hydrodynamic simulations have shown, for 
example, that the size of a spheroid at the time of its formation 
primarily reflects the degree of dissipation involved ~ i.e. the loss 
of angular mome ntum by disk gas and its participatio n in a dense, 
central starburst jCox et al.ll20b 6b: Onorbe et al. 200^: ICiotti et al.l 
l2007l : Ijesseit et alj|2007l : iHopkins et al.ll2009bllei) . If all the mass 
of a spheroid were formed in such a starburst, then one would ex- 
pect an extremely small size < kpc, comparable to the sizes of e.g. 
ULIRG starburst regions. If, on the other hand, none of the mass 
were formed in this way, the size of the remnant would simply re- 
flect the (large, ~ 5 — 10 kpc) extents of disk/star-forming progeni- 
tors. This leads to a natural expectation for size evolution. 

Disks at z = 2 are observed to be much more gas -rich than 
those of comparable mass today (e.g. lErb et al* '2006*). so their 
merg e rs will naturally lead t o smalle r remnants ( Khochf ar & Silkl 
l200d : IHopkins et alTl2007 j. |2009d). In fact, simulations have 
shown that simply scaling extended, low-density disks in the lo- 
cal Universe to the typical gas fractions of L, disks at z > 2 
is su fficient to produce ~ lO" Mq ellipticals with Re ~ Ikpc 
jHop kins et al. 2008a). Ellipticals that form at lower redshift (most 
of the population), from less gas-rich mergers, will be larger, so 
the mean size-mass relation will evolve. In particular, these sys- 
tems, forming more of their mass from violent relaxation of the 
(low-density) progenitor stellar disks (those being relatively gas- 
poor, yielding little starburst), will have a larger "envelope" of low 
density material at large radii. 

Meanwhile, the compact, early-forming ellipticals will un- 
dergo later "dry" mergers, in particular with later-forming, more 
gas-poor disks and less dense ellipticals (less dense because they 
formed later, from less gas-rich disks, as described above). This 
will preferentially add mass to their low-density profile "wings" 
and increase Re, allowing them to "catch up" to later forming sys- 
tems. Thus not only does such a general scenario anticipate evolu- 
tion in the median size-mass evolution, but also the nature of such 
evolution: early buildup of dense regions via dissipation, followed 
by later growth in the extended, low-density wings as the progeni- 
tor population becomes less gas-rich with cosmic time. 

That being said, cosmological galaxy formation models have 
difficulty explaining how systems could catch up from the most 
extreme size evolution seen in the observations: a factor of ~ 6 
smaller effective radii Re at fixed stellar mass in the most m assive 
galaxies at z = 2 (see iToft et al]|2007l : Isuitrago et alj | 200 8'). The 
models above predict a more moderate factor ~ 3 evolution in the 
most massive systems. In addition, the expected efficiency of merg- 
ers should leave some galaxies un-merged (at least without major 
re-mergers or several minor re -mergers) since their original gas- 
rich, spheroid-forming merger ("Ho pkins et a"i] |2009d). If no other 
effects act on these systems, this surviving fraction would over- 



predi c t the number of su ch compact sy stems today i Truiillo et al.l 
I2OO9I : iTavlor et alj |2009^ (but see also IValentinuzzi et alj l2009t) . 
Moreover, some local observations have argued that, at fixed stel- 
lar mass, ellipticals with older stellar population ages and/or those 
in the most dense (and early-forming) environments may have the 
largest radii for their mass - if so, these systems have evolved 
to "overshoot" the median of the local size-mass re lations, evolv- 
ing by more like a factor ~ 10 in effective ra dius ( iGallazzi et al.l 
l2006l : lBernardi et alllloOTl : [Graves et ai]|2009l) . (Although this de- 
pends on w hether "mass" is defined by dynamical or stellar m ass, 
and we note lvan der Wei et alJl200^ : IValentinuzzi et al]|2009l . who 
reach different conclusions.) As a consequence, other interpreta- 
tions of the observations and theoretical models for size evolution 
have been debated. 

In this paper, we show how dissipation drives the formation 
of smaller ellipticals at high redshift (§|2} and consider the differ- 
ent explanations that have been proposed for how these massive, 
high-redshift ellipticals increase their apparent sizes at lower red- 
shifts, and construct the predictions made by each individual model 
for other observable quantities, including their velocity dispersions, 
central densities, masses, and profile shapes (§[3}. We compile ob- 
servations of these quantities and other constraints to break the de- 
generacies between the different models (§|4}. We then compare a 
model motivated by cosmological simulations, in which many of 
these effects occur at different points in the galaxy's evolution. We 
show how such a mixed model tracks through the predicted space, 
and how relatively small contributions from each of the proposed 
explanations combine to yield order-of-magnitude cumulative size 
evolution (§[5j. We summarize and disc uss our conclusions in §|7] 

Throughout, we assume a WMAP5 ( Komat su et al.l200^ cos- 
mology, and a Chabrier (.2003, ) stellar IMF, but the exact choices 
make no significant difference to our conclusions. 



2 WHY DOES THE SIZE-MASS RELATION EVOLVE IN 
THE FIRST PLACE? 

First, we must consider how small, high-mass ellipticals are formed 
initially. Th is is disc ussed in detail in e. g.lKh ochfar & Silk (200g); 
iNaab et alj ( |2009|) : iFeldmann et"^ ( |2009|) and IHopkins et al] 
( l2009d) : but we briefly review these results here. 

Figure [T] summarizes the important physics. First, consider 
the mass density in passive ellipticals as a function of redshift 
{top; from Bell et al. 2003; Bundy et al. 2005, 2006; Abraham et^ 
2007; Daddietal. 2005: Labbe et alj |2005| ; Ivan Dokkum et al . 
i2006. ; .Grazian et al..2007,) Pf It is well-established that this declines 



rapidly with redshift; we fit the observations shown in Figure[T]with 
the simple functional form pcii oc exp (— Az) and find A ~ 1.2—1.6. 
In other words, at z = 2, only ~ 5% of the z = 0, ~ L* ellip- 
tical mass density is in place. It is possible to consider this in 
greater detail in terms of number counts or as a function of galaxy 
mass, but the qualitative results are similar in each case: the pop- 
ulation "in place", represented by the compact, high-redshift mas- 

^ Specifically, we plot the mass density in bulge-dominated galaxies, which 
is not the same as the absolute mass density in all bulges. At high redshifts 
z > 1.5 observed morphologies are ambiguous; we show the mass density 
in passively evolving red galaxies as a proxy. This may not be appropriate, 
but at z < 1 the two correspond well, and the compactness, size, and kine- 
matics of the "passive" objects do appear distinct from star-forming ones 
iKriek et al. 200^; iXoft et alj|2007l : IXruiillo et al.ll2007l : iFranx et al.lHooi : 
lOenzel et al..2008ft . 
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Figure 1. Top: Evolution in the mass density in spheroid-dominated (com- 
pact) galaxies. Points are observations; dashed lines a fit. Evolution is steep; 
at all z, most spheroids are recently-formed and have not experienced e.g. 
dry mergers. The size-mass relation must primarily reflect how spheroids 
form in situ. Middle: Evolution in typical gas fractions of star-forming 
(spheroid progenitor) galaxies of total (baryonic) mass such that their major 
merger will yield a > lO" Mq spheroid. Points observed; error bars show 
the scatter in /gas at fixed mass, not the (smaller) uncertainty in the mean. 
Line shows a fit. Bottom: Predicted sizes of ~ 10"Mq ellipticals formed 
in situ at each redshift from gas-rich mergers w ith the expected (/gas jz) 
Circles show observed (mean) sizes compiled in lvan der Wei et aU j200a 
Black diamonds show actual hydrodynamic simulation renmants with the 
same progenitor disk size/structure but with different gas fractions appro- 
priate for each (/gas (z) ) . Black solid line shows a fit, given the median scal- 
ings of (/gas(z)) and Re(fw)- Dotted line is the same, allowing for a 1 Gyr 
delay after each merger before it is observed as a "passive" remnant. Red 
diamonds are similar simulations, but also include viewing biases and stel- 
lar population effects (e.g. mock images at the appropriate times) and allow 
for the maximum observationally inferred (weak) progenitor disk size evo- 
lution (i?disk oc (1 +2)"^""^). Red dashed line shows the appropriate fit in- 
cluding this additional scaling. Evolving gas-richness and rapid new buildup 
of ellipticals drives the evolution in the size-mass relation. Dry mergers and 
other effects do not dominate the relation, but explain how early-forming 
systems "catch up to" (or exceed) the relation at low redshifts. 



sive elliptical population, is only a small fraction of the popula- 
tion that will be present at any significantly lower redsh ift (e.g 
van Dokkum et al. 2008; Rriek et al. 2008a; Perez-Gonzalez et al 



2008; Mar chesini et al.ll2008l ; Ivan der Wei et al.ll2009l ; lllbert et al 



2009.) FI As such, evolution in the median size-mass relation - i.e. 
the average size of galaxies at a given stellar mass - must reflect 
evolution in the sizes at the time of formation. 

This is very important; at any redshift, most of the spheroid 
population is recently formed, and has not had to evolve from some 
earlier redshift via e.g. dry mergers or any other channel. The evo- 
lution in the size-mass relation cannot, therefore, be the result of 
all ellipticals forming early (with some size) and then dry merging 
or experiencing other processes that increase their size to z = 0. 
Rather, at each redshift, the size of ellipticals forming at that time 
must be larger than the size at the time of formation of ellipticals 
that formed earlier. These "newly formed" spheroids, which com- 
prise most of the population, will dominate the size-mass relation 
at that redshift, independent of how the earlier-forming population 
evolves (which constitute part of the scatter, but do not dominate 
the median relation). 

What, therefore, determines the sizes of ellipticals at forma- 
tion? At both low and high redshifts, star-forming disk and/or ro- 
tationally supported galaxies have much larger sizes than ellipti- 



cals of similar mass (see e.g lKonnendvlll985l; IShen et alj|2003l; 
iTruiillo et al]|2004l; iToft et aLllToOTl ; iBuitrago et alj|2008l) More- 



over the slope of the disk/rotationally supported galaxy size-mass 
relation is distinct from that of the spheroid size-mass relation at all 
z = — 3 (references above). And, in addition, observations have 
shown that disk sizes do not appear to evo lve wi th redshift nearly as 



|2004l; iFereuson et al. 


,2004; Harden etal. 2005; Toft et al. 


,20071; 


Akivama et alj|2008l; 


Buitraeo et al.l2008l;ISomerville et al. 


2008h. 



Parameterizing size evolution with redshift (at fixed mass) as a 
power-law Rc{M, \ z) oc (1 -\-z)~^ , the observations constrain the 
maximum [3 for disk galaxies to be /? < 0.6 at intermediate masses 
and P < 0.8 at the highest masses (with many of the observations 
still consistent with /3 ~ for disks). Massive ellipticals, on the 
other hand, appear to evolve with /3 ~ 1.4 — 1.7. It is clear, there- 
fore, that the sizes of ellipticals at formation, and the evolution in 
their size-mass relation, cannot simply reflect the sizes of their pro- 
genitors. 

Simple phase-space considerations, however, make it impos- 
sible to increase densities in dissipationless (purely stellar) merg- 
ers (Hernquist et al. 1993). But in sufficiently gas-rich mergers, 
the remnant size can be much smaller than that of the progen- 
itor. Gas dissipation makes this possible; di sk gas loses angular 
mom e ntum via internal torques in the merger jBames & Hernguistl 
Il99ll Il992[) . and can then dissipate energy, fall to the center, 
and build stars in a compact ce ntral starburst on scales <C kpc 
jMihos & HemauistllT994blll996l) . High-resolution hydrodynamic 
simulations, and basic physical arguments, have shown that this 



Although there is some debate regarding the degree of evolution in 
number density of the very most massive galaxies at z < 0.8, the ~ 
L*, M, ~ 10" Mq population on which we focus here (and which de- 
fines the observed samples to which we compare) dominates the ef- 
fects shown in Figure [T] and clearly shows a rapid decrease with red- 
shift eyen_ove£_thisra^ Bundv et alj 
2005; Pannella et al. 2006; Frances chini et al.l 120061; iBorch et ali l2o6a 
Brown et al. 2007; Pozzetti et al. 2009^!^ Moreover, the rapid decline in pas- 
sive spheroid number density with redshift is clear at all masses at z > 1.5. 
the range of particular interest here. 
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degree of dissipation is the primary determinant of the remnant 
r"^ B H I n 1 ( 1 

spheroid size (Cox et al. 2006b; Robertson et al. 2006a; Naab et al. 

"2006a; Onorbeetah 2006; Ciotti et al. 2007; Jesseit et al. 2007, 
20081 ; Icovington et al. j I2OO8I ; iHopkins et alj l2009bl leh. To lowest 



order, this degree of dissipation - i.e. the mass fraction formed 
dissipationally - simply reflects the cold gas frac tions available 
in the progenitor disks at the time of the merger (Hop kins et al.l 
l2009ch . To rough approximation, one can fit the results of high- 
resolution simulations of gas-rich mergers (references above) and 
estimate how the remnant spheroid size scales with this gas frac- 
tion: l/g,,) ^ R,{M,\f^.,, = 0) exp(-/g,,/0.3). (Based on 
the arguments above, Re{M, \ /gas = 0) is equivalent, modulo a ge- 
ometric prefactor, to the pre-merger disk effective radii.) If the gas 
mass is sufficiently large (~ 1/2 the galaxy mass), then the rem- 
nant size (half-mass radius) will simply reflect the sub-kpc scales 
of the starburst region; if it is sufficiently small, it has no effect and 
the remnant sizes reflect those of disk progenitors. 

The natural expectation, of course, is that the gas frac- 
tions of higher-redshift star-forming galaxies will be larger than 
those of low-redshift systems of the same mass; this has now 
been seen in a number of di rect observation s . Figu r e [T] (mi<jrf/e) 
compi les o bservations from iBell & de Jon3 ( l200lh, iKannappaij 
' 20041) . and iMcGaugi] (l2005h at low re d shift, a nd Ishaplev et al ' 



ana iMciiaugH (izuupj) at tow re d smtt, a nd isnaptev et al 
lErbetalJ ( l2006h ; IPuech et alj (|2008|); iMmnucci et al 



("2009) a t redshifts z ~ 1 - 3 (see also 'Calura et aL| |2008| ; 
[porste r Schreiber et alj|2009l ; lErbl|2008)F| Specifically, we plot the 
gas fractions observationally inferred for disk/rotationally domi- 
nated, star-forming galaxies of baryonic masses ~ 0.5 X 10" Mq 
(the stellar masses may be a factor of a couple lower, correspond- 
ing to the observed /gas) such that after a major merger (which 
will increase the total mass and turn the gas into stars), the sys- 
tem will be comparable to the ~ lO" Mq observed massive, com- 
pact ellipticals. Figure[T]shows that /gas grows from ~ 0. 15 — 0.20 
at z = to ~ 0.4 — 0.5 at z = 2.5 — 3 (approximately as (/gas) = 
0.16(1+2)°'*'). 

This leads to an expected evolution in the sizes of spheroids 
at their time of formation. A detailed set of predictions for 
this size evolution as a result of evolving gas fractions is pre- 
sented in H opkins et al. (2007a, 2009d) and a similar model in 
iKhochfar & Silk! ((2006); in Figure [T] we simply summarize the 
key result. Figure [T](fco/toOT) plots the expected size of major gas- 
rich ~ 10" Mq merger remnants, considering the mean gas frac- 
tions as a function of redshift (shown above). We compare these 
with the observed average sizes of spheroids of the given mass 
at each redshi f t. We show the results of simulated remnants from 
iHopkins et al. I l l2009h) . with the same initial disk sizes appropri- 
ate for this mass, but with the relevant /gas for the median at each 
redshift. We also show the corresponding median trend estimated 
by simply combining the fitted i?e(M* | /gas) scaling above with the 



At z = 0, the gas fractio ns sh own are based on measured atomic HI gas 
fractions jBell & de Jong ( 2001) conect this to include both He and molec- 



ular H7;|McGaugh (2005) correct for He but not Ht ; iKannappai] j2004l 
gives just the atomic HI gas fractions (this leads to slightly lower esti- 
mates, but still within the range of uncertainty plotted; H2 may account for 
^ 20 — 30% of the dynamical mass, per the measurements in ljogee et alj 
l2005h . We emphasize that these gas fractions are lower limits (based on 
observed HI flux in some annulus). At z = 2, direct m easurements are not 
always available; the gas masses from Erb et al are estimated indi- 

rectly based on the observed surface densities of star formation and assum- 
ing that the z = Kennicu tt law holds; other indirect estimates yield similar 
results iCresci et alj2009l) . 



{/gas(z)) scaling. In these cases the disks are all just as extended 
as low-redshift disks (i.e. yield i?<.(10" Mq | /gas = 0) ^ 6-7kpc, 
with no evolution in disk size with redshift). Already, this appears 
sufficient to explain at least the high end of the observed evolution. 
We could also consider the trend if we allow for a delay between 
merger and observation (because the observed systems are pas- 
sive, one might preferentially select systems that had their gas-rich 
merger ~Gyr ago; and so had correspondingly somewhat higher 
/gas reflecting the expectation at that earlier time); this gives similar 
but slightly stronger evolution. We can also allow for some mod- 
erate disk size evolution. If we scale the sizes of progenitor disks 
(and, as a consequence, the remnant Re in the absence of gas) by 
the maximum allowed by the observations, cx (I +2)""'', we again 
find similar but slightly stronger evolution. These simple size pre- 
dictions agree very well with the observed spheroid sizes at all in- 
termediate and high redshifts. 

In other words, it is straightforward to form ~ 1 kpc-sized 
~ 10" Mq ellipticals at z = 2, and in fact such sizes are the 
natural expectation given the observed/expected gas-richness of 
spheroid-forming mergers at these redshifts. The difficulty is not 
"how to form" such ellipticals, nor is it even to explain the av- 
erage evolution in the size-mass relation. Rather, the difficulty is 
that, as discussed in § [T] such systems clearly do not passively 
evolve to z = (where they would constitute a small but eas- 
ily detectable fraction of the local spheroid population); in fact 
there does not even appear to be evidence for a small fraction of 
such systems retaining their small sizes to z = 0. It may even be 
the case that early forming systems would to be the largest for 
their mass (i.e. h ave lower effective densities) at z = (see e.g. 
[Graves et al.ll200 9. and references in § [TJ. Although this remains 
observationally unclear, it is generally agreed that the most highly- 
clustered systems, massive BGGs and BCGs (the expected de- 
scendants of the most massive, first-to-assemble high-redshift sys- 
tems) appear to lie significantly above the Shen et al. (2003_) size- 
mass relation for more recently-assembled field galaxies (see e.g. 



. gga 

iBatcheldo r et alj2007 ; von der Linden et aI.l2007l ; lKormendv et al.l 
2009; Lau er et al.ll2007d ; lBemardi et al.ll2007[l 

Some process, therefore, not only increases the sizes of these 
high-redshift early-forming systems so as to "keep pace" with the 
mean evolution in the size-mass relation, but may even need to 
"overshoot" the relation. Some hint of this can be seen even in Fig- 
ure [TJ at the lowest redshifts, assuming all ellipticals are formed 
in situ from gas-rich mergers actually K«rfer-predicts the median 
low-redshift sizes. By this late time (unlike at high redshifts) a non- 
trivial fraction of the population has formed earlier and undergone 
some subsequent evolution, bringing up the average size at these 
masses. In what follows, therefore, we consider how such systems 
might grow in size at a pace equal to or greater than the rate of 
change in the size of newly-forming systems shown in Figure[T] 

Note that the sizes shown in Figure[T] and the simulation sizes 
to which we refer throughout this paper (and, where possible, the 
observed sizes) refer to the semi-major axis lengths, for elliptical 
systems. This is almost identical to the (projected) circular radius 
R that encloses 1/2 the light (on average, the two are the same, 
in simulations). However, this is not identical to the "circularized" 
radius R^m = Vab, where a and b are the major and minor axis 
lengths {a « R^), respectively (e = l — b/a being the standard def- 
inition of ellipticity). The reason for our choice is that the major 
axis length or projected half-light circular radius is more physi- 
cally robust, and relevant to the constraints from phase space den- 
sities and merger histories. A thin disk, for example, viewed edge- 
on, has a vanishingly small circularized radius (arbitrarily small b). 
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even though the parameter of physical interest, the scale length, 
is the same. Examination of the suite of simulations shown here, 
for example, shows that while at low ellipticity systems can be ei- 
ther large or small, there are no systems with very large ellipticity 
and large Rckc, even though such systems are merely flattened rel- 
ative to their counterparts, and have similar scale lengths, velocity 
dispersions, energ etics, and physical phase space densities. Follow- 
ing Dehnei]fl99a) . one can show analytically that dissipationlessly 
randomizing a system which is flattened owing to some rotation, 
conserving total energy and phase space density, leads to very little 
change in the projected major-axis radius (this is the basic reason 
why the projected Re of dissipationless disk-disk merger remnants 
is similar to the in-plane — 1.65 A of their progenitor disks, dis- 
cussed above), but will obviously increase ifcirc by an arbitrarily 
large factor depending on the thickness of the original system. Be- 
cause more gas-rich mergers lead to remnants with more rotation 
(on average), the ellipticity of the systems in Figure[T]can be some- 
what higher at high redshift, and therefore their circularized radii 
evolve even more steeply (averaging over all inclinations, though, 
the effect is weak, adding a power ^ (1 +2)"° ^' to the redshift 
evolution). 



3 THE MODELS AND THEIR PREDICTIONS 

We consider the following sources of size evolution for systems 
initially formed at some high redshift (z ~ 2) as compact, massive 
galaxies (Re ~ 1 kpc, M, ~ lO" Mq), illustrated in Figure|2] 

"Identical" Dry Mergers: Often, when the term "dry 
mergers" is used in the context of models for size evolution, 
what is actually assumed is not general gas-poor merging but 
specifically spheroid-spheroid re-mergers, between spheroids with 
otherwise identical properties (or at least identical profile shapes 
and effective densities, in the case of non-equal mass mergers). 
In a 1:1 such merger, energetic arguments a s well as simulations 
jHemguist et alj[l993l : iHopkins et al]|2009eh imply profile shape 
and velocity dispersion are conserved, while mass and size double 
(more generally. Re oc M). 

Minor/Late Accretion: In fact, the scenario above is not 
expected to be a primary growth channel for massive ellipticals, 
nor is it expected to be the most common form of "dry merger." 
Rather, in models, at later times the typical secondary (even in 
major mergers) is a later-forming galaxy - a gas-poor disk or 
more "puffy" spheroid that was itself fo rmed from more gas- 
poor mergers and therefore less compa ct jHopkins et al. 1 12009 j ; 
iNaab et alj |2009| : iFeldmann et al.l |2009|) . At the highest masses, 
growth preferentially becomes dominated by m ore and more minor 
mergers with low-ef fective density galaxies jMaller etal]|2006l : 
iHopkins et al.ll20093) . The secondaries, being lower-density, build 
up extended "wings" around the high central density peak in 
the elliptical. The central density and velocity dispersion remain 
nearly constant, while the Sersic index o f the galaxy increases 
with the buildup of these wings (see e.g. iNaab & Truiillolbood : 
IHopkins et al .I2009el l3). The effective radius grows much faster per 
unit mass added in these mergers - it is possible to increase Re of a 
given elliptical by a factor ~ 6 per mass doubling (Re oc ''). 

Adiabatic Expansion: If a system loses mass from its central 
regions in an adiabatic manner, the generic response of stars and 
dark matter will be to "puff up," as the central potential is less deep. 



For a spherically symmetric, homologous contraction of shells with 
circular orbits, this reduces to the criterion that M{r) r — constant. 
More general s cenarios behave in a similar man ner (modulo small 
corrections; see lZhadl2002l : lGnedin et aT]|2004) . If a galaxy could 
lose a large fraction of its central (baryon-dominated) mass, either 
by efficiently expelling the material from stellar mass loss or by 
blowing out a large fraction of the baryonic ma ss in an initiall y 
very gas-rich system (from e.g. quasar feedback; iFan et al]|2008h . 
the radius will grow correspondingly (Re oc M~'). The profile 
shape will be conserved (to lowest order, although this depends 
at second order on the stellar age distribution versus radius) but 
uniformly inflated, the central density will decrease sharply, and 
the velocity dispersion will decrease oc R~ ' . 

Mt^/L Gradients: Massive, old ellipticals at z = hav e weak 
color gradients (McDermid et al. 2006; Sanchez-Blazqu ez et al.l 
I2OO7I and references therein). As such, the effective radius in 
certain optical bands is generally a good proxy for the stellar mass 
Re- However, the same is not necessarily true for young ellipticals 
recently formed in mergers; these can have blue cores at their 
cent ers with young stars just fo rmed in the merger-driven starburst 
(e.g. iRothberg & Joseph! l2004h . Simulations and resolved stellar 
population analysis suggests that the resulting gradient in M»/L 
(brighter towards the center) can lead to smaller Re by up to a factor 
~ 2 in optical bands (e.g . rest-frame S), relative to the stellar mass 
Re dHopkins et al ]|2008ci) . As the system ages, these gradients will 
vanish and Re in optical bands will appear to increase; moreover, 
depending on the exact band observed, the late- time iJf.ught may 
actually over-estimate the stellar mass Re (as e.g. age gradients 
fade and long-lived metallicity gradients remain, yielding a 
redder center and hence apparently less concentrated optical light 
distribution). Thus at both early and late times, M, /L gradients 
can, in principle, yield evolution in the size at fixed wavelength, 
while conserving the stellar mass Re. Obviously, the central mass 
density will remain constant, and the velocity dispersion will only 
weakly shift (with the appropriate luminosity- weighting). 

Seeing/Observational Effects: The large effective radii of 
massive, low-redshift ellipticals are driven by material in low sur- 
face density "envelopes" at large radii. This is difficult to recover at 
high redshifts. More over, galaxy surface brightness profiles are not 
perfect [Sersid ( 1 19681) profiles, so the best-fit Sersic profile and cor- 
responding Re will depend on the dynamic range observed (see e.g. 
Boyla n-Kolchin e t al. 2005; Hopkins et al. 2O09a)- Together, these 
effects can, in principle, lead to a wiaWsv fitted Re (from sampling 
only the central regions, inferring a smaller and less low-density 
material) at high redshifts (although it is by no means clear that the 
biases must go in this direction). At lower redshifts, where surface 
brightness limits are less severe, more such material would be re- 
covered, leading to apparent size-mass and profile shape evolution 
(Re changes at fixed M»), without central surface density or velocity 
dispersion evolution. 

Another effect that might occur is that high-redshift ellipticals 
could be more flattened than those at low redshift. If the defini- 
tion of effective radius used is that of the "circularized" radius 
/?circ ~ Vcib, then a flattened high-redshift system could have small 
^ciic from edge-on sightlines (and smaller median ^circ, by a lesser 
factor), per the discussion above in §[2] Indeed, many such compact 
systems are observed to b e relatively elliptical dValentinuzzi et al] 
I2OO9I ; Ivan Dokkum et alj |2008). This could in fact be a real 
physical effect - high redshi ft systems mig ht have more rotation 
or larger anisotropy (see e.g. Ivan der Wei & van der MareL.2008i) . 
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Figure 2. Evolution of physical {top) or observationally inferred {bottom) surface stellar mass density profiles, according to different models. We consider six 
models, described in the text: the physical stellar mass profile can change owing to identical dry mergers (doubling M, and Re\ top left), minor/late accretion 
(building up an extended envelope; top center), or adiabatic expansion (mass loss leading to uniform inflation; top right). The inferred mass profile (fitted from 
observations, assuming standard stellar populations and constant M, / L) can change owing to the presence of stellar mass-to-light ratio gradients (from young 
cenfi'al stellar populations; bottom left), seeing effects and surface brightness dimming (points show a simulated z = 2 profile with typical seeing and surface 
brightness depth, solid line is the best-fit r'/^-law profile given the observed range; bottom center), or discrepancies between the true and best-fit stellar mass 
(owing to e.g. contribution of AGB stars; bottom right). In each case, the plotted initial profile is a factor of ~ 2 — 3 "too small" relative to the z = size-mass 
relation. Each model is tuned so that, after evolution according to the model, the same final profile (in this case that matching the observed profile of a typical 
massive core elliptical on the local size-mass relation, NGC 4365) is recovered. Properties of the initial galaxies and final remnant (same in all cases) are given; 
aiTows show effective radii. 



Some process, e.g. dynamical heating from bars, or minor mergers, 
or clumpy star formation, could then vertically heat the systems 
by scattering stars and make them more round, while contributing 
little net mass or energy. Major axes would be little affected, 
while circular radii would increase. This effect could also occur 
owing to selection effects; if high-redshift samples (selected via a 
combination of Sersic indicex estimates and/or stellar population 
properties) include diskier systems or more early-type disks (e.g. 
Sa galaxies). In either case, we include it as part of this category 
because the systems would appear to evolve along similar tracks, 
as both the apparent and real evolution in i?circ would involve no 
significant change in a, M*, or Ec. 

Stellar Mass Uncertainties: If the (uncertain) contribution 
of AGB stars to near-infrared light is large i n young elliptical s 
(ages < 2Gyr, similar to the ellipticals at z ~ 2: lKriek et al.ll200^ . 
then the stellar mass M, as derived from commonly used stellar 
population models lacking a proper treatment of the TP- AGB 
phase (e.g. iBruzual & Chariot! '20031) may be over-estimated by 
factors ~a few (Maraston 2005). The difference will vanish as the 
populations age. This change in inferred M, will lead to apparent 
Re — Mt evolution (M« changes as fixed Re)', systems will appear 



less massive, but conserve Re, a, and profile shape. Likewise, 
redshift evolution in the stellar initia l mass function, suggested 
(indirectly) by some observations jHopkins & Beacom |2006| ; 
Ivan Dokkum 2008; Dave 2008), could yield a similar effect. 

Figure [3] plots how individual galaxies (or, since the history 
of an individual galaxy will be noisy, the median of a population 
of similar galaxies) evolve forward in time, according to these dif- 
ferent models. We assume that at z = 2, all systems "begin" on 
an observed Re — M« relation similar to that inferred for observed 
systems at high mass - specifically at an observed lO" Mq with 
Re = \ kpc. They are evolved such that, at z = 0, they will lie on the 
observed size-mass relation from lShen et al. I ( I2OO3I) . In each case, 
we assume that one and only one of the effects above operates, and 
we consider the strength of the effect to be arbitrary - we make it 
as strong as necessary to evolve the systems onto the z = rela- 
tion. Given this amount of evolution, though, we can quantify by 
how much e.g. the stellar mass must change. Likewise, we show 
how the velocity dispersions, the central/peak stellar mass density, 
and the best-fit Sersic index will change with time as the systems 
evolve towards the z = i?e — M, relation along these tracks. 

This is not directly comparable to what is done observation- 
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Figure 3. Evolution of a fixed population of galaxies, from fixed initial conditions, according to different models. Top Left: Tracks made by galaxies in the 
size-mass plane. Solid line is the adopted "initial" z = 2 size-mass relation - galaxies begin at lO" Mq with i?e ~ 1 kpc, and evolve until they lie on the z. = 
size mass relation (dashed line). We consider six models that could lead to apparent or physical size/mass evolution, described in the text. In the following 
panels, we see how this requires other changes in galaxy properties according to the model. Top Center: Corresponding evolution in effective radius (relative 
to the initial 7, = 2 value) for each model, for the fixed population of galaxies as a function of redshift. The rate of evolution is such that they lie on the observed 
size-mass relation at all times. Top Right: Corresponding evolution in stellar mass, required to produce the given size evolution. Bottom Left: Corresponding 
evolution in the velocity dispersion a. Bottom Center: Evolution in the central/maximum surface stellar mass (not luminosity) density (i.e. some M» kpc~^ 
averaged inside e.g. ~ lOOpc, or containing 1% of the light; not the effective surface brightness). Bottom Right: Evolution in apparent galaxy profile shape 
in e.g. fixed rest-frame B-band, parameterized with the best-fit Sersic index nj. The different models clearly map out different average tracks in this space. 



ally, as the systems "end up" at different stellar masses. However, 
knowing how a galaxy population evolves forward in each model, 
it is straightforward to calculate how properties should evolve at 
fixed stellar mass, looking back on populations with the same ob- 
served M, at different redshifts. We show this in Figure|4] In detail, 
we force the z = galaxies to lie on all observed relations between 
structural properties and mass, and to match the observationally in- 
ferred evolution of Re (M, ) for ~ lO" Mq galaxies, where the most 
dramatic size evolution has been seen. Integrating all populations 



^ Specifically, we adopt the observed z = relations: effective radius 
Re = 3.7kpc(M,/lO"M0)O-5'' iShenetal. 2003), veloci ty dispersion 
a = 190kms-' (M, /lO" Mq)" ^'* tHvde & Bemardill20O9[) . central stel- 
lar mass suiface density = 0.65 X lO" MfT, kpc" ^ (M*/10" Mq)'"" 
(fitted from the compilation in lHopkins et al ]|2009d) , and Sersic index (fit- 
ting the entire galaxy profile to a single Sersic index; as noted above this 
should be treated with some caution as the results depe nd on the fitted 
dynamic range) = 4.5 (M,/10" Mq)""* iFerrarese et^ l200d) . The 
central surface density Sc must be defined: one can adopt either the ex- 
trapolation of the best-fit Sersic profile to r = 0, or the average surface 
density within some small annulus (e.g. fixed physical ~ lOOpc or fixed 
fractional ~ Re/ 50); t he details are discussed in lHopkins et al.l l2009al) and 
iHopkins et all j2009j) . but the differences are small for our purposes (E is 
only a very weak function of R in massive systems at these radii). For con- 
venience we adopt the fixed lOOpc mean definition. We assume the specific 
form for the evolution of the size-mass relation Re (M» \ z) = Re (M» | z = 
0)x(l-|-z)~''^. This is convenient and yields a good fit to the observations 
shown in Figure|4] appropriate for massive galaxies, but the evolution may 
be weaker at lower masses (which has no effect on our conclusions). 



backwards in time, we then reconstruct the properties at fixed stel- 
lar mass (~ 10" Mq) according to each model. 

Clearly, these correlated properties can break degeneracies be- 
tween different models tuned to reproduce the size distribution. For 
example, if adiabatic expansion were the explanation for the ob- 
served evolution, the velocity dispersions of ~ 10"Mq galaxies 
at z ~ 2 — 3 would have to be ~ 2 — 3 times larger than those 
today (because the system needed to have much more mass in- 
side a small radius that was subsequently lost, and the expansion 
is homologous) - i.e. typical a ~ 600km s~' at these masses. On 
the other hand, if minor/late accretion is responsible, the primary 
change has been the buildup of low-surface brightness "wings" in 
the profile, which contribute negligibly to a (leading to dispersions 
~200-250kms"' atz~2-3). 

We compare with data compiled from recent literature. A 
number of measurements of the size distribution at fixed mass, 
hence the median size evo lution, for massiv e (M» ~ 10" Mq) 
galaxies are compiled in van der Wei et al. ('2008'); we adopt 
their compilation (see Trujillo et al. 2006a; Longhetti e t al.l 



van Dokkum et alj 


20081: lFranxetaI.1 |2008|; iRettura et al.1 


20081; Buitrasoetal 


20081). The Sersic indices of the high- 



Ivan der Wei et al.l ( 12008) ; we compare these to the distribution 
of Sersic indices a s a fun c tion of stellar mass at z = presented 



in Hopkin s et al. j2008d. I2009bll3); itself a compilation from 
Rothberg & Josepd il^ki; iKormendv et alj j2009l); iLauei- et al.1 



( I2007ta) ; iFerrarese et alj ( 1200^ . (Although'van der Wei et J]|2008l 
note that a slightly higher Sersic index, similar to the z = obser- 
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Figure 4. Evolution in properties of galaxies at fixed stellar mass, given the different models from Figure|3] Top Left: Size evolution (median size observed at 
redshift z, for galaxies of the same fixed observed stellar mass, relative to the median size observed at z = 0). We adjust each model to give an identical size 
evolution at fixed mass, motivated by the observational fits to the size evolution of high-mass (> lO" Mq) galaxies - i.e. attribute the o bserved size-mass 
relatio n evolution in each case entirely to just the one model. We compare to the observed size evolution (points; from the compilation in Ivan der Wei et al] 
|2008|) . In the following panels, we see the predicted consequences of this for other quantities measured at fixed stellar mass. Top Right: Velocity dispersion. 
Observations are compiled in Cenarro & Truiillo (2009) (squares; the z > 1.2 point comes from stacked spectra), as well as Capoellari et al. (2009) (triangles; 
also stacked spectra). We also show the recent measurement from van Dokkum et al. (2009, star), but note this is a single individual object. Bottom Left: Cen- 
tral/peak stella r mass surface density ( Mj, kpc ~-). Observations a re compiled in Hop kins et al, t 2009a). Bottom Right: Profile shape/Sersic index. Observations 
compiled from lHopkins et al.l j2009bl z = 0). lvan der Wei et alj (2008, z ~ 1). and lvan Dokkum et M j2008l z ~ 2). The different tracks in Figure[3]lead to 
different predicted evolution in these quantities at fixed mass, if the observed size evolution is attributed to each model in turn. 



vations, is also compatible with their sample). The high-redshift 
observations do not resolve the small radii needed to directly 
meas ure the central s t ellar rn ass density E^; we adopt the estimates 
from |Ho pkins et al.l (l2009ah based on the best-fit profiles pre - 
sented in Ivan Dokkum et al. and van d er Wei et al] j2008t) . 

extrapolating the fits from ~ 1 — 2kpc inwards. In low-redshift 
systems this typically represents an upper limit (a factor ~ 1 — 2 
larger than the true central d ensities). The velocity di spersion 
measurements are compiled in ICenarro & Truiilld j2009l) ; at the 
highest redshifts, they are measured therein from stacked spectra. 
Recently, Cappellari et al. (2009) present a similar stacked analysis 
(and a re-analysis of the same objects) from z = 1.4 — 2 and 
z = 1.6 — 2, along with a couple of individual object measurements 
of cr; we show their results as well (from the stacked spectra; the 
individual detections are on the low-cr end of the allowed range 
from the stack). We also show the recent detection presented in 
Ivan Dokkum et id] ( |2009|) . of a very large a ~ 500 km s ' (albeit 
for a more massive M, = 2 x lO" Mq system); however, we note 
that this is a single object (one of the brightest in the field), not a 



statistical sample, and the uncertainties in the measurement of a 
are large. 



4 PROBLEMS WITH EACH INDIVIDUAL MODEL 

None of the models is ideal. Identical dry mergers can easily ex- 
plain core creation in massive ellipticals (owing to the "scouring" 
action of binary supermassive BHs), but move systems relatively 
inefficiently with respect to the Re — M, relation. It requires a very 
large (order-of-magnitude) mass growth to get systems onto the 
z = size mass relation from z = 2; this would yield too many 
~ 10'^M(Ti systems today (relative to e.g. the local mass function 
from Bell et al. 2003 ), given the n umber density of compact z = 2 
systems ( v an Dokkum et al.ll2008h . The predicted evolution in Sr 
is much too strong, and that in a somewhat so. 

Minor/late mergers are a more efficient way to increase effec- 
tive radii relative to the size-mass relation. This model fares best 
in Figure |4] - in fact, it is the only model that appears at least 
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marginally consistent with all the observational constraints. How- 
ever, the implied number of such mergers, to yield the full factor 
~ 6 evolution in Re ( M* ) , is large - larger than that predicted by cos- 
mological mode ls jKhochfar & Silkl l2006l : iHopkins et al]|2009d : 
iNaabetalJlioO^ or permitted by observational constraints imply- 
ing ~ 1 — 2 n iajor dry mergers (typical mass ratio ^1:3) per system 
since z = 2 (van Dokkum 2005; Bell et al. 2006; Lin et al.l f2003 ; 
[Bridge et aLi200 9; Darg et al, 2009a,b). Figure [3] shows a typical 
z ~ 2 compact system will have to increase its mass by a factor 
~ 1.5 — 2 to reside on the z = relation via this mechanism. This 
is not unreasonable, but it is still difficult, especially if in fact the 
high-redshift systems must "overshoot" the z = relation (such that 
older systems have larger Re). This is ultimately reflected in the fact 
that models including just thi s effect tend to predict more moderate 
factor ~ 3 — 4 size evolution ("Khoch far & Silkl2006l ; lHopkins et al.l 
l2009dl) . It may be possible that the high-redshift systems reside 
preferentially in special environments with enhanced merger rates 
(relative to even other halos with the same mass and formation red- 
shift); but we will show below that such an explanation is not nec- 
essary. 

Adiabatic expansion involves mass loss, so there is no issue 
with the mass function. However, the mass loss required to yield 
a large change in Re is correspondingly large, > 50% of the z = 2 
mass. Even if all stars were just formed at this time, it is unlikely 
that stellar evolution could release this much mass (and it must 
be unbound, not simply recycled or he ated). Given obser ved ages 
of the z > 2 systems of ~ 0.5 - 1 Gyr jKriek et alj|2006l) . the ex- 
pected subsequent mass loss is only ~ 20%. One could posit that 
these systems have some other gas reservoir "about to be" blown 
out, but this appears to conflict with their low s tar formation rates 



and directly mea s ured gas properties (see e . g. iDaddi et al 
iKriek et alj|200^ l2008bl ; IWuvts et ahlboOTl ; iTacconi et al 



2005 



2008 . 



and references therein). Moreover, Figure |4] clearly shows that the 
predicted evolution in a and is much larger than that observed. 

Mass-to-light ratio gradients should be present at z = 2, if the 
observed stellar population gradients at low redshift are extrapo- 
lated back in time. However, using these to obtain more than a 
factor ~ 2 in size evolution (the natural maximum in simulations) 
requires extremely low in the central regions (since there 

is effectively an upper limit to M«/L at large radii given by stel- 
lar populations with an age equal to the Hubble time). Such pop- 
ulations would require near-zero age, not the ~ 0.5 — 1 Gyr ages 
observed. It is also likely that dust, at such low age, would can- 
cel out some of these eff ects, as in observed ULIRGs and recent 
merger remnants (see e.g. iTacconi et alj|2003 ; iRothberg & Joseph! 
I2OO4I) . Similar behavior (even frequent red cores in the merger and 
shortly post-merger phase ) are also seen in gas-rich merger simu- 
lations jWuvts et al.l2009l) . Moreover, to the extent that this affects 
the fitted Sersic indices, accounting for the entire size evolution 
would imply higher n, at high-z (since high-Wj profiles have more 
concentrated central light), in conflict with the observations in Fig- 
ure |4] 

Attributing the entire evolution to incorrect stellar mass es- 
timates appears similarly unlikely. It requires invoking some- 
thing more th a n jus t the known differences betw een e.g. the 
iMaraston etall ilOOdt) and iBruzual & CharlotI ( l2003f ) stellar pop- 
ulation models - applied to the observed z = 2 systems with the 
best p resent data, these g ive only factor ~ 1.4 difference in stellar 
mass dWuvts et al.ll2007l) . In any case, the required evolution in a 
clearly disagrees with the observations. 

Observational biases from e.g. profile fitting appear 
marginally consistent with the constraints in Figure |4] However, 



attempts to calibrate such effects typically fi nd they lead to bias in 
high-redshift sizes at the factor ~ 2 level (Bovlan-Kolchin et alj 
[2005; Hopkins et al. 2009a) or smaller (van der Wei et al. 2008^ 
not the factor ~ 6 — 10 desired. Stacking the high-redshift data 
also appears to yield similar sizes, so it is unlikely that a very 
large fraction of the galaxy mass lies at radii not sampled by the 
observations jvan der Wei et al.ll2008h . In fact, experiments with 
hydrodynamic simulations suggest that, if anything, biases in fit- 
ting Sersic profiles may lead to over-estimates of the high-redshift 
sizes (S. Wuyts et al., in preparation), and the higher dissipational 
fractions involved in forming compact ellipticals can yield sharp 
two-component features that bias the fits to higher Sersic indices 
and corresponding effective radii. Moreover, it is difficult to 
invoke these effects to explain the observed Re{M,) evolution at 
lower stellar masses, where z = Sersic indices are relatively low 
so there is less of an effect from the extended tails of the light 
distribution. 

It is also straightforward to check whether or not evolution 
in the shapes of ellipticals, at otherwise fixed major axis radii and 
structural properties, accounts for the observed evolution (via use 
of the circularized radius). Several of the studies discussed here 
present not just the masses and effective radii of their systems, but 
also their ma j or and mi nor axis leng t hs; we c ompile these from 
Trui illo et al] (I2006bll3); iLauer et al.l (l2007bh; Ivan Dokkum et al.l 
l l2008l) ; lDamianov et al.l ( f2009l) ; lKormendv et all ( [200^ . a sample 
from z = — 2.3, and use this to compare the evolution in ma- 
jor and minor axis lengths. Restricting our analysis to major axis 
lengths alone, we find that the evolution is slightly weaker than 
that using circularized radii (in other words, there is some increase 
in the median ellipticity of the samples with redshift); however, 
the effect appears to be relatively small, accounting for a factor 
~ (1 +z)~''''^""'^' in evolution (i.e. ~ 20% of the total size evolu- 
tion). This is comparable to the effects anticipated from simulations 
(see §|2j. 

As observations improve, it appears unlikely that these effects 
can account for the full evolution, although it may be important 
(altogether) at the factor ~ 1.5 — 2 level and cannot be entirely ruled 
out by the constraints in Figure |4] alone. 



5 AN A PRIORI COSMOLOGICAL MODEL 

In principle, all of these effects can occur We therefore consider 
a cosmological model for galaxy growth in which they are all in- 
cluded. We follow a "typical" massive spheroid forme d at z ~ 3 
from the cosmological model in lHopkinsetai] ( l2009dh . using the 
high-re s olution hydrodyn a mic simulations presen t ed in Cox et al] 
l l2006bl) ; [Robertson etall l l2006bl) ; iHopkins et al] ( l2008d) . Tho"se 
simulations include self-consistent models for star formation and 
black hole growth, and feedback from bot h, that enable the sta- 
ble evolution of even very gas rich systems dSpringel et ai]|2005bl; 
Springel & Hernquist 2005). They survey a wide range of param- 
eter space, in both gas-richness, progenitor redshift, and structural 
properties of the merging galaxies, that make them well-suited for 
the experiment below. We illustrate the evolution of the system in 
Figure |5] 

The system first becomes a spheroid in a z = 3 merger of 
two ~ L« disks, with typical gas fractions for their mass and red- 
shift. We specifically cho ose as representative o ne of the simula- 
tions described in detail in IHopkins et al. I( l2008al) (tvpicalof merg- 
ers with such gas fractions, with common orbital parameters and 
halo properties). The simulation "begins" at z = 3 (and completes 
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Figure S. Illustration of a typical history from cosmological simulations, realized in high-resolution hydrodynamic simulations, for a massive, early-forming 
spheroid. Top: Image of stellar surface density (left), and axially-averaged projected stellar surface mass density profile (center, solid) after the spheroid first 
forms in a z ~ 3 gas-iich (/gas ~ 30 — 40%) merger. Physical properties as seen at z = 2 are shown. We also show the profile decomposed into the dissipational 
(post-starburst) stars (with mass fraction ~ 25%) and dissipationiess (pre-merger disk) stars. We construct the rest-frame B-band profile (right) with half-light 
radius Re (Lb), and a mock z = 2 observation (with mock resolution, PSF, and surface b rightness limits) with subs equent best-fit Sersic function effective radius 
i?f.(fit) and index tis. We compare to the observed z>2 compact spheroid profiles from lvan Dokkum et alj l2008h . over the dynamic range outside the PSF and 
above the background limits. Second from Top: Same, after adiabatic expansion and stellar evolution are allowed to operate. Stars lose appropriate mass for 
each stars age/metallicity evolved forward to z = 0, which then virializes; the resulting mass profile is compared to the original (z = 2) profile (center). We also 
re-construct the B-band profile, with the stellar populations aged in this way (allowing for M, / L evolution per the observed/simulated properties), and re-fit it 
assuming image depth comparable to local observations. Second from Bottom: Same, including a single (mass ratio 1:3) "identical" dry merger (high-redshift 
dry merger with similarly gas-iich disk or compact elHptical) in the history. Bottom: Further adding a small series of later major and minor mergers of less- 
dense systems (typical lower-redshift material in disks and spheroids). Since this material has lower dissipational content (forming from lower-redshift, more 
gas-poor disks), the net diss ipational fraction goes down, and extended envelopes preferentially build up. We compare the B-band profile to a few observed 
massive spheroids in Virgo iKormendv et alj2009l) . 
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Figure 6. Predictions of a cosmological model wliere all the effects in the text contribute to size evolution (as in Figure|5). In each, we consider the prediction 
for the net evolution (black solid line) as in Figure|4] We show the contribution to evolution in each track from each of the individual effects described in the 
text. The "late/minor merger" channel dominates growth, yielding a factor ~ 3 — 4 size evolution with small (factor ~ 2) mass growth, but each of the other 
effects here contributes a small ^ 20% additional effect. Together, this gives an additional factor ~ 2 — 3 needed to explain the observed trends with redshift 
given cosmologically realistic merger/growth rates and without violating constraints from stellar mass functions or other observations. 



at about z ^ 2.4), and models an equal-mass merger of two iden- 
tical massive disks, each with a gas fraction (at the time of the 
merger) of 30 — 40%, representative of observations near these 
masses terb et alj|200i) . The remnant formed in such a gas-rich 
merger is very compact (since this 30 — 40% is entirely chan- 
neled into the central starburst, the effective radius ~ 50% mass ra- 
dius - lies just outside the compact central starburst region). Since 
the halo mass has crossed the "quenching thre shold" whe re cooling 
becomes inefficient (~ IO'^Mq; see iKeres et al.ll2005l) with this 
merger, and the observations indicate that these high-redshift sys- 
tems do not form many stars from their formation redshifts to today 
(likewise massive ellipticals today have not formed significant stars 
since z ~ 2 — 3), we turn off future cooling, and evolve the system 

J iassively for a short t ime to z = 2 (to allow it to relax and redden 
Springel et alj|2005 j) ). Figure [5] (top) shows the resulting stellar 
mass surface density profile, together with some salient parameters. 
The system has a true stellar mass of M, = 1 .0 x lO" Mq , a mass- 
weighted central velocity dispersion of cr = 260km s~', a cen- 
tral/peak stellar surface mass density of S,. = 2 x 10"MQkpc~^ 
and a stellar effective radius = 1.3kpc. Note that a, Er, and Re 
are projected quantities; here, we sample the system at each time by 
projecting it along ~ 100 lines of sight, uniformly sampling the unit 



sphere, and quote the median, but the sight line-to-sightline varian ce 
in these quantities is small (< 0.1 dex; see lHopkins et aLlboOSd) . 

We then construct the light profile of the system at z~2. The 
stellar population properties (ages and metallicities) are determined 
self-consistently from the star formation and enrichment model in 
the simulation; dust properties are computed self-consistently from 
the simulation gas and sub-resolution ISM model, following the 
methodology in Hopkins et al. (2005b, a) (but at this time, the gas 
is depleted or in the hot halo, so this is a relatively small correc- 
tion). The 6-band light weighted stellar population age at this time 
is ~ 500 Myr, ver y similar to that in ferred from the observed high- 
redshift systems l lKriek et^l200 6V Figure |5] shows the true rest- 
frame 6-band light profile, and corresponding parameters. There is 
a gradient in the stellar mass-to-light ratio, owing to the recently- 
forming starburst populations; as a result, the profile shape and ef- 
fective radius are slightly different in fi-band than in stellar mass. 
The 6-band effective radius is 1 . 1 kpc; M, /L gradients in this sys- 
tem are not negligible, but contribute only a ~ 20 — 30% effect in 
the size. 

We compare this directly to the obse rved best-fit (de- 
convolved) profiles of z > 2 galaxies from Van Dokkum et all 
I2OO81) , shown over the range where the observations are well- 
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sampled and not severely affected by the PSF (roughly ~ 1 — 
4kpc). The two agree reasonably well. 

But in order to estimate the observational effects of e.g. lim- 
ited dynamic range and fitting, to compare directly to the observed 
best-fit sizes, we construct a (toy model) mock observation of the 
system. We consider an image in the rest-frame S-band, with instru- 
ment quality comparable to HST imaging, and a simple representa- 
tive psf|3 and (simple noise) sky background, and fit it in a manner 
designed to mimic the observations (here a one-dimensional fit to 
the circularized profile allowing for varying ellipticity and isopho- 
tal twists with radius). The mock "observed" best-fit Sersic param- 
eters are shown in Figure[5] The best-fit observed Re is 0.8 kpc, with 
a fitted Sersic index of ns = 3 (= 0.9 kpc if we fix iij = 4). Obser- 
vational biases from missing some of the outermost light and, more 
importantly, having the best sampling of the profile in the central 
regions where the profile shape is not the same as in the outer re- 
gions (thus leading to a fit that reflects that central portion), lead to 
a slightly smaller fitted size than the true S-band size. But the ef- 
fect is small, only 10 — 20%. Simple experiments suggest that the 
presence of such an effect depends on the exact image construction 
and fitting methodology, and a more thorough comparison of sim- 
ulated profiles and high-redshift data will be the subject of future 
work (S. Wuyts, in preparation); the important point here is that the 
effect being small at early times means that it is not especially im- 
portant whether or not such a bias exists, in explaining the overall 
size evolution of the system. 

Note that the system here is relatively round, with a median el- 
lipticity (over an ensemble of random sightlines) ofe = 1 — b/a^ 
0.2 — 0.3. Thus, if we were to adopt the circularized radius instead 
of the circular half-light radius or major axis radius, we would 
obtain a smaller radius by a factor of \fbja ~ 0.84 — 0.89. Al- 
though a small correction, this is comparable to the other effects 
here and is not negligible - even in a relatively round system viewed 
from a random viewing angle, the circularized radius yields another 
~ 10 — 20% smaller apparent size. In more flattened systems, or 
from the more extreme viewing angles for this particular simula- 
tion, the effect can be as large as ~ 30 — 40%. 

We can also consider the role of stellar mass uncertainties: 
the maximum of this effect is given by the following. We model 
the "true" stellar population parameters with the lMarastorj l l2()o3) 
stellar population models (including a large contribution to the NIR 
light from AGB) s t ars; w e then fit the mock photometry with the 
iBruzual & CharlotI ( l2003h models. If we do so, the best-fit stellar 
mass obtained from the mock image is M, = 1.2 X 1O"M0. In 
other words, the maximum of this effect is a ^ 20% bias in stellar 
mass, which given the observed correlation Re oc NpJ' , would lead 
to just a 5% effect in the apparent size-mass relation. 

As the system evolves in time, the stellar mass-to-light ra- 
tio gradients become progressively weaker (the system becomes 
uniformly old, and m oreover metallicity gra dients increasingly off- 
set age gradients; see lHopkins et al ]|2009bl) . as does the difference 
between the stellar mass inferred from different stellar population 
models, and observed at z = with the quality of the best observa- 
tions, biases in the fitted size will vanish as well. In fact, because of 
metallicity gradients (higher metallicity towards the self-enriched 
starburst material at the center) yi elding a sligh tly red c ore (typi- 
cal in z = massive ellipticals; see lPeletier et al...l990 ; Caon et al.l 



119901 ; iKuntschner et al.ll2003 ; ISanchez-Blazguez e"tai]|2007l) . the 
B-band Re here is about ~ 10% larger than the stellar mass Re at 
late times. If this is all that would happen, of course, the observed 
B-band size will simply converge or slightly over-estimate the true 
stellar-mass size; giving only a factor 1.5 — 2 evolution and yield- 
ing a system that is still much smaller than comparable ellipticals 
today. 

But, as the system ages, other effects will contribute. First, 
consider adiabatic expansion. Mod eling the stellar populations as 
with the iBruzual & CharlotI ( l2003h models, we know how much 
stellar mass loss occurs as a function of time. Since observations 
indicate the systems have evolved passively since high redshift, 
we assume that this mass lost is hea ted (easily accomplishe d by 
e.g. shocks or AGN feedback, see e.g. ICiotti & OstrikeilboOTh and 
contri butes to the hot, virialized halo in hydrostatic equilibrium 
jCox et al. 2006a). Since all that matters for the size of the system is 
that most of the gas not be in the compact central regions, it makes 
no difference whether we assume this model or that the gas is ex- 
pelled completely. Likewise, it makes no difference if we assume 
all the remaining gas at the end of the merger is rapidly expelled 
(most of the gas has, at that point, been depleted by star formation; 
the much larger gas reservoir for adiabatic expansion comes from 
stellar mass loss). At each time, we then adi abatically expand th e 
system according to the mass lost following lOnedin et al.l ( |2004|) . 
Although, in principle, up to ~ 50% of the stellar mass is recycled 
by stellar mass loss, much of this occurs when the stellar popula- 
tions are very young (to rough approxi mation, mass loss rates a t 
late times decline as ~ r^'). Given the IBruzual & CharlotI l l2003h 
models and the distribution of stellar population ages already in 
place at z = 2, the stars here will lose only ~ 20% of their mass 
over the remainder of a Hubble time. At z = then, the net effect 
of adiabatic expansion (alone) would be to leave the profile shape 
the same, reduce the stellar mass to 0.8 x lO" Mq, inflate the ef- 
fective radius to 1.5 kpc, and decrease the velocity dispersion and 
to cr = 200kms^' (note that a has contributions from dark mat- 
ter and large radii, so is not quite as strongly affected as might be 
expected). 

The system experiences a significant merger history, involving 
several different mergers. Here, we selec t a representati ve merger 
history from the cosmological models in lHopkins et al.1 (|2009gfl 
where these are discussed in detail, for a galaxy that is already a 
massive ~ lO" Mq at z > 2. We simulate that merger history at 
high-resolution, and show the results in Figure|5] 

For simplicity, and to correspond to the model classification 
scheme used throughout this paper, we divide the mergers in the 
cosmological model into "identical dry mergers" and "minor/late 
mergers", and consider each in turn (note that, for the final rem- 
nant, the exact time-ordering of the mergers does not make a signif- 
icant difference). First, consider the "identical dry merger": around 
z ~ 2, the system experiences a (marginally) major merger (mass 
ratio ~1:3) with another spheroid that was, itself, also formed in a 
~ 30 — 40% gas merger (again, since this is still at high redshift, 
these numbers are typical; more gas-poor systems are not typi- 
cal), and should therefore be similarly dense. We model this by 
considering a 1:3 merger with a similar spheroid formed, itself, in 
an equally gas-rich merger. The effective radius expands Re oc M, , 



* For the sake of generality, we adopt a simple Gaussian PSF with 1 a width 
= 0.5 kpc (FWHM= 1.2 kpc), representative of the best-case resolution of 
most HST observations at z > 0.5. 



The merger rates from this model can be obtained as 
a function of galaxy mass, redshift, and merger mass ratio 
from the "merger rate calculator" script publicly available at 
,http : / / www . cf a . harvard . edu /~phopk ins/ Site/mergerca'lc . html| 
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as expected, and the velocity dispersion increases very slighitly (it 
is not perfectly constant because there is some preferential trans- 
fer of energy to the least-bound outer material instead of the cen- 
tral regions). The net effect is a relatively small increase in size 
(Re — 1.9kpc) and little growth relative to the size-mass relation. 

However, at the same time and, increasingly, at later times, 
the system experiences a number of minor and major "late" (less 
dissipational or less dense) mergers. Specifically, in this system, 
we consider just those above a mass ratio 1:10 (more minor merg- 
ers contribute a negligible ~ 5% to the final mass). In general, 
iHopkins et al.l ( 2009g) show that > L, systems have enhanced 
merger rates, but the mergers become progressively more minor 
(on average) as the system grows in mass. This simply reflects the 
fact that most of the mass in the Universe is in ~ L* systems, so 
the growth will be dominated by mergers of those systems (which 
are minor when the system is 3> L,). In the particular Monte Carlo 
history modeled here, this is manifest in the series of "minor/late 
mergers." 

The system, around z ~ 0.5 — 1, experiences its last major 
merger, with a more gas-poor disk of mass ^ 0.5 L, (a roughly 
1:3 merger). The disk has a gas fraction of ~ 20% at the time 
of merger - still not negligible, but significantly lower than the 
~ 40% that made the original spheroid. Therefore it contributes 
proportionally more dissipationless (stellar disk) material, which 
is low-density relative to the dissipational (gas/starburst) material, 
and therefore preferentially builds up the "wings" of the profile. 
Similarly, this is accompanied and followed by a sequence of a 
~1:5 merger, a ~1:6 merger, and a ~1:10 merger, spanning red- 
shifts z ~ — 1.5. Whether those secondary galaxies are modeled 
as disks or spheroids makes little difference - the key is that they 
have the appropriate gas or dissipational fractions for their mass 
and redshift (~ 10 — 20%). In fact, we also find that the precise time 
ordering or even the mass ratios of the mergers is not a significant 
source of uncertainty in the predicted mass profile. To lowest or- 
der, the important quantities are simply the total mass added to the 
"dissipationless" low-density envelope (i.e. material contributed by 
lower-redshift, low-density stellar disks, whether or not these come 
in minor units or are pre-processed into spheroids) versus the mass 
added to the "dissipational" starbu rst component (for a detailed dis- 
cussion, see lHopkins et alj|2009el l3). We appropriately account for 
adiabatic mass loss in each galaxy as described above, and con- 
struct a mock image of the final remnant as observed at z = 0. 

Allowing for this merger and accretion history, the system 
has grown by a moderate (albeit non-trivial) factor ~ 2.5 in stellar 
mass. This is small enough that there is no conflict with observed 
stellar mass function constraints: even if every such compact z > 2 
massive (M* > lO" Mq) passive galaxy grew by such a factor, it 
would account for only ~ 20 - 50% of the z = M. > 2 - 3 x 
lO" Mfi^ spheroid population (see e.g. Perez-Gonzalez et al]|2008l : 
iMarchesini et alj|2008l : Ivan der Wei et al.ll2009l) . The velocitv dis- 
persion grows by a small amount for the same reasons as in the 
"identical dry merger", but is still moderate for the total stellar 
mass, a ~ 240km s~'. But the effective radius has grown by a fac- 
tor ~ 6 — 10. The stellar mass and 6-band effective radii are now 
~ 7 kpc, and the fitted Re is slightly larger, owing to the large best- 
fit Sersic index ~ 7 (which itself owes to the extended envelope 
of low-density material acquired via these late/minor mergers). The 
system is also extremely round at this point - there is only a ~ 5% 
difference between the circular half-light radius and the "circular- 
ized" radius. 

We compare the predicted final profile to the observed 
light profiles of the ~ 10 most massive Virgo ellipticals (from 



iKormendv e"tai]|2009l) . spanning a mass range ~ lO'"^ " 'Mq. 
The profile appears quite typical. Note that although the central 
density inside ~ lOOpc may be slightly high, by a factor ~ 1.5 — 2, 
this is still well inside the scatter in central profile shap es typical in 
larger samples of ellipticals (see e.g. lLauer et alj2007d) . Moreover, 
our simulations do not include any process of core "scouring," by 
which BH-BH mergers are expected to eject stars inside precisely 
these radii (~ 10 — lOOpc, for galaxies with the masses of our final 
remnant) and convert initially "cuspy" nuclear profiles (formed in 
gas-rich starbursts as those here) into " cored" profiles. Including a 
toy model for such a process following lMilosavlievic et al] ( l2002h . 
we find that the predicted profile is fairly typical. 

Therefore, allowing for the combination of all the effects con- 
sidered here, we find it is possible to account for even order-of- 
magnitude size evolution in the most massive, early-forming com- 
pact spheroids. The most important contribution to that evolution is 
the "minor/late merger" channel, as expected based on our com- 
parisons in § [3] But as noted in § O given cosmologically re- 
alistic merger histories and observational constraints on galaxy 
merger/growth rates, this channel is only expected to contribute a 
factor ~ 3 — 4 in size growth. The remaining factor ~ 2 — 3, we 
find, comes from a combination of the other effects considered 
here: gradients, observational effects, adiabatic expansion, 

and identical dry mergers. Each of these effects contributes only 
a relatively small ~ 10— 30% effect to the size evolution - but to- 
gether, this yields the net factor ~ 2 — 3 additional evolution needed 
to reconcile the observations at z > 2 and z = 0. 

Given the combination of these effects, we can now attempt to 
predict the evolution of properties at fixed mass, as in Figure|4l but 
for realistic systems where all of these effects act at different levels. 
We use the Monte Carlo merger populations from Hopkins e t al] 
( l2009gh . together with the simple analytic fitting functions for how 
sizes and velocity dispersions change in mergers of a given mass ra- 
tio, si ze ratio, and gas fraction, from lHopkins et al] j2009el . l2008al . 
|2009d), and we model the effects of adiabatic expansion in each 
system as described above. The magnitude of observational bias as 
a function of r edshift we estimate fol lowing the (very simplified) 
methodology inl Hopkins et al. Il l2009ah . and mass-to-light ratio gra- 
dients are modeled as a function of time by using the mean scalings 
of stellar population gradien t strength with post-me rger gas fraction 
and age from simulations in lHopkins et al.l ( l2009bt) . The details are 
not particularly important; we could assume all these effects scale 
with time and redshift exactly as in the specific experiment shown 
in Figure|5]and we would obtain a similar answer. 

Figure |6| shows the results. We plot how the median size, ve- 
locity dispersion, central stellar mass surface density, and best-fit 
Sersic index scale with redshift at fixed mass, given this combi- 
nation of effects. In each case they agree well with the observa- 
tions, and are similar to the "minor/late merger" track in Figure|4] 
as expected. We also show the contribution to each effect from each 
of the different channels modeled. As before, although late/minor 
mergers dominate, the other effects together contribute compara- 
bly, and no single one of those effects is especially more important 
than the others. 



6 IMPLICATIONS FOR THE EVOLUTION IN BH-HOST 
GALAXY CORRELATIONS 

Given these models, we briefly consider their implications for the 
evolution in the correlations between black hole (BH) mass and 
various host galaxy properties. 
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Figure 7. Predictions for the evolution in BH mass at fixed host galaxy stel- 
lar mass, in the style of Figure|4](?op) and Figure|6](feo??om), for the same 
models. Observational effects. M, /L gradients, and identical dry mergers 
all give no evolution; adiabatic expansion and M» uncertainties lead to 
higher (real or infen'ed) bulge masses at high-z, hence lower A/BH/Mbulge- 
Accounting for subsequent BH accretion only makes the predicted evolu- 
tion more negative. Minor/late mergers allow for evolution towards larger 
A^BH/^buloc at high-redshift; the bulge "catches up" via later mergers with 
e.g. gas-poor disks. We show both the upper limit of such evolution (as- 
suming that the late/minor mergers contribute no BH mass) and the cosmo- 
logically expected evolution (given BH masses and merger histories from 
hydrodynamic simulations). Bottom: Same, from the full model of §[5] with 
contributions from each effect. Observ ational con s traints are compared, 
from H aring & Rix (2004, bl ack st ar), IPeng et al] j20 0^ blue squares) 



IWoo et al.l (20061) ; iTreu et all i2007 '. green triangle). andl Salviander et alj 
i2007L orange inverted triangles). Upper limits derived from observations of 
the spheroid mass density a nd the fact that BH m ass cannot decrease with 
time are also shown, from IHopkins et al] jlOOd red circles). The model 
agrees reasonably well, but the observations are systematically uncertain, 
and various biases probably affect the determinations at high redshifts (see 
text). 



In the models which do not involve mergers, considered 
here, the expected evolution of BH-host correlations is particu- 
larly simple. We perform the same exercise as in § [S] and re- 
quire that all the models reproduce the observed z = BH-host 
relations. Specifically, we enforce the z = relatio n between BH 
mass and bulge stellar mass (Mbh = 0.0014M.; Haring & Rijj 
[2004.) . although adopt instead the z = Mbh — o relation (Mbh = 



10*^ " (cr /200km s~")'^°^ pTremaine et ai]|2002h makes no differ- 
ence. Since the BH cannot decrease in mass with time, this sets at 
least a limit on the evolution. We evolve the systems backwards in 
time, and predict the observed BH mass at fixed galaxy properties. 

In the case of stellar mass-to-light (M* /L) ratio gradients and 
observational/seeing effects, the prediction is trivial. Neither of 
these effects has any affect on either the galaxy stellar mass or the 
BH mass - systems simply evolve passively and appear to change 
in radius. 

The case of stellar mass uncertainties is only slightly more 
complex. Here, there is no effect that can change the BH mass; but 
at high redshifts, the system appears to be higher-stellar mass, by 
an amount tuned to match the apparent evolution in the size-mass 
relation. Therefore, for an (apparent) mass-selected sample at high 
redshift, since the "true" stellar mass is lower, the BH mass must 
be lower, and there will appear to be strong negative evolution in 
Mbh- 

Adiabatic expansion requires significant host bulge mass loss 
to have occurred since the time of BH and bulge formation, so it 
predicts that BH masses at high redshifts should be substantially 
smaller than those today (at fixed stellar mass; such that after this 
host mass loss they will lie on the z = Q BH-host relation). In the 
prediction shown in Figure[7](top), we assume that the BH does not 
grow in mass over this time (i.e. the entire z = BH mass is in place 
at the time of formation). It is possible that, instead, some small 
fraction of this material from adiabatic mass loss is consumed by 
the BH, increasing its mass. However, this means that the BH mass 
at the time of formation must be lower; therefore, the evolution in 
Mbh at fixed bulge mass would be even more negative (towards 
lower Mbh(z)/Mbh(0)). Likewise for the other models above, if 
further accretion occurs subsequent to the initial bulge formation. 

The case of identical dry mergers is also straightforward. Un- 
like the cases above, the BH will grow in time - with each dry 
merger, the BH should grow via the merger of the two progenitor 
BHs. But since the systems are, by definition, structurally identical, 
they should obey the same proportionality between BH mass and 
host galaxy stellar mass. So BH mass and stellar mass simply add 
linearly, and the BH-host mass relation (being linear itself) is con- 
served by these mergers. As a consequence, there is no predicted 
evolution in Mbh at fixed stellar mass (i.e. MBu/Mbuigc), although 
Mbh and Mbuigc can both grow significantly for an individual sys- 
tem. 

The minor/late merger case is more interesting. As before, 
BH mass will grow via mergers. However, unlike the identical dry 
merger case, it is not necessarily true that the BH masses of both 
merging systems obey the same BH-host galaxy correlations (and 
therefore that BH and host mass will add linearly and conserve the 
Mbh — Mbuigc relation). The previous scale-invariance can be broken 
in two ways. First, imagine the case of a merger with a late-forming, 
gas-poor disk-dominated galaxy (for the case of simplicity, take 
the extreme limit of a gas-free, nearly pure disk secondary). Since 
the secondary has little or no gas, there will be no new accretion; 
since it has little or no bulge, it has corresponding little or no BH. 
As a consequence, the secondary merger contributes negligible BH 
mass. But the entire secondary disk stellar mass will be violently 
relaxed, adding substantially to the total bulge mass. Similarly, 
spheroids and disks that merge later will have formed later, from 
more gas-poor mergers. Such mergers build up a less-deep central 
potential and fuel less material to the BH, and so will form less 
massiv e BHs, relative to their bulge masses (see e.g. lHopkins et al.l 
l2007ah . The result in either case is that Mbh at high redshifts should 
be larger at fixed stellar mass. 
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An upper limit to the magnitude of tliis effect is easy to de- 
termine. In this case, assume that no BH mass is contributed from 
the subsequent late/minor mergers. This could be the case if e.g. 
the depletion of gas fractions occurs sufficiently rapidly with cos- 
mic time, or if the late/minor mergers involve preferentially very 
disk-dominated galaxies, or if BH-BH mergers are inefficient and 
the systems are relatively gas-poor. Because the BH mass still can- 
not decrease, and must lie on the observed z = correlations, this 
amounts to assuming the entire z = BH mass is in place at the 
time of formation, with only bulge mass added since. The result- 
ing prediction is up to a factor ~ 3 increase in MBn/Mbuigc from 
z = 0-2. 

A more realistic prediction requires some model for how mas- 
sive a BH will be as a function of bulge properties at formation 
(e.g. gas richness, velocity dispersion, mass, etc.), to determine how 
much BH mass is contributed by t he late/secon dary mergers. One 
such model is present ed in detail in Hopkins et al. ( 2009b); similar 
cases are discussed in iHopkins et al.l j2007 a') and Crotoa ilOO^ . It 
is not our intention here to develop those models (their predictions 
for the evolution in BH-host correlations are discussed in much 
greater detail therein); but we briefly summarize the salient points 
and show the results. 

As in most models, BH growth is regulated by some form 
of feedback, from energy or momentum associated with the ac- 
cretion luminosity. Various works have shown that this can natu- 
rally account for the observed correlations between BH mass and 
spheroid velocity d i spersion, mass, and ot h er properties (see e.g. 



Silk & ReesI Im^. JPi Matteo et alj |2005| ; iHopkins & Hemguistl 
200d;'Hopkins et alj2007^ Generically, in such models, the "most 
fundamental" correlation is between BH mass and some measure of 
the binding energ y of the material which must be "halted" to arrest 
further accretion; IHopkins et al. I j2007ah show, in high-resolution 
hydrodynamic simulations, that this can be approximated by a 
"true" fundamental plane-like correlation between BH mass and 
the combination of stellar mass and a, Mbh oc m1'^ a^ -, at the time 
of bulge formation in gas rich mergers. Direct analysis of the ob- 
served BH-host correlations app ear to s upport such a driving cor - 
relation (s ee e.g. [aj pkins et al1 l2007bl ; lAller & Richstonell2007h . 
Moreover, Younger etaL) ( 12008 a*) showed that such a model pre- 
dicts a unique difference between e.g. the Mbh — o" relation of 
"classical" bulges (believed to be formed in mergers) and "pseu- 
dobulges" (formed in secular events, from e.g. disk bars), because 
the observed velocity dispersion relates differently to the central 
binding energy in structurally distinct objects. Since then, vari- 
ous observations have found that such a difference exists, with 
the sense and magnitude pr edicted IHuI|2008| ; iGreene et al.ll2008l ; 
iGadotti & Kauffmann|[2003) . In any case given such a model for 
the BH mass as a function of bulge properties (at e.g. fixed mass 
or cr), it is straightforward to calculate the contribution from sub- 
sequent minor/late merger s (which we do followi ng, in detail, the 
equations and approach in IHopkins et al] j2009bl) ) and (given the 
z = constraint) predicted evolution in Figure|7] 

Essentially, the result is similar, but with somewhat weaker 
evolution than the upper limit case (some non-trivial BH mass is 
contributed by these subsequent mergers). The net evolution, as dis- 
cussed in Hopkins et al. (2007a) and Hopkins et al. (2009^) owes 
to the fact that because high redshift mergers are more gas rich and 
yield more compact remnants, they have higher central binding en- 
ergies than low-redshift counterparts (for the same stellar mass) and 
therefore will produce higher-mass BHs. The bulge mass "catches 
up" via accretion of less dense, later-forming systems with lower 



B H mass for the ir bulge, and (in some cases) less bulge, as outlined 
in lCrotoiJ (l200d). 

We compare these predictions to various observational es- 
timates of the evolution. Specifically, we consider observations 
from z = - 1 from SDSS AGN samples dSalviander eT^bOOd 
l2007h . from z = 1 — 3 from lensed quasar hosts jPeng et alj|2006h. 
and at z = 0.36 from narrow-redshift Seyfert samples dWoo et al] 
l2006l ;l Treu et al.[ 2007). We caution, however, that these estimates 
(indirect BH mass estimates based on the virial mass indica- 
tors) are systematically uncertain, and various biases are present 
in AGN-selected samples that will tend to systemat ically over- 
estimate the amount of evolution at high redshift (see lLauer et al.l 
[2007a) (although the lSalviander et al. 2007. estimates do att empt 
to correct for this bias). Alternatively, IHopkins et al. I ( l2006l) de- 
termine a non-parametric upper limit to the degree of evolution 
by comparing the observed spheroid mass density at each red- 
shift to the BH mass density at z = 0, given the constraint that 
BHs cannot decrease in mass; we show this as well. Other in- 
direct constraints, from e.g. clustering and integral constraints, 
give qualitatively similar results, but with a large scatt er (see 
e.g. [M erloni et al. 2004; McLure & Dunlop 2004; Alexan der et all 
l2005l [20081 ; lAdelberger & Steidell |2005| ; iFine et alj 1200^ . Al- 
though the various biases and uncertainties involved are large, it 
appears robust that there is probably some evolution towards higher 
Mbh at fixed galaxy stellar mass. We hesitate to use this as a quan- 
titative constraint on the models here, but note that this appears to 
strongly conflict with the adiabatic expansion and stellar mass error 
models, and moreover stress that only the late/minor merger chan- 
nel provides a mean to evolve in the observed sense, at any level. 



7 DISCUSSION 

We have illustrated how different physical effects combine to ex- 
plain the observationally inferred size evolution of massive ellipti- 
cals with redshift, and shown how various observations can distin- 
guish between these models. 

Observations have shown that, at each time, most of the 
spheroid population is recently assembled. As a consequence, the 
evolution in the size-mass relation as a whole must reflect a red- 
shift dependent scaling in the in-situ sizes of ellipticals at the time 
of fo rmation. As is discus sed in detail in iKhochfar & Silld ( |2006|) 
and Hopkins et ^ i2009d'), this occurs naturally in merger mod- 
els: at the same mass, higher-redshift disks have larger observed 
gas fractions. The gas fraction at the time of merger, which yields 
the compact, remnant starburst stellar populations required to ex- 
plain elliptical stellar population gradients, kinematics, mass pro- 
file shapes, and densities, is the dominant deter minant of the size 
of any merger remnant (see e.g. [Hopkins et ■d]i2008a . and refer- 
ences therein). As a consequence, mergers of these gas-rich disks 
at high redshifts will yield spheroids with smaller effective radii. 
We show that simulated merger remnants with the appropriate gas 
fractions as a function of redshift naturally reproduce the observed 
evolution in the mean trends. 

That being said, there does not appear to be a " relic" popula- 
tion of high-mass, very compact spheroids (see e.g. iTruiillo et al.l 
[2009). It may even be the case that the most highly clustered (and 
therefore earliest-to-assemble) spheroids (i.e. BCGs) may have 
somewhat larger sizes for their mass. Therefore, some other pro- 
cess must increase the apparent sizes of these systems after their 
formation, at least keeping pace with the evolution in the median 
population owing to redshift-dependent gas fractions. 
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We discuss six effects that can contribute to sucii evolution, 
tiiat cover the range of possibilities in the literature. These include: 
"identical" dry mergers (mergers of identically compact spheroids), 
"late/minor accretion (major or minor mergers at later times, with 
less dense material - for example, less gas-rich disks or other 
spheroids formed in lower-redshift, less gas-rich mergers), adia- 
batic expansion (owing to mass expulsion from feedback or grad- 
ual stellar mass loss from stellar population evolution), the pres- 
ence and subsequent evolution of stellar mass-to-light ratio gradi- 
ents (changing e.g. the observed optical or near-IR Rc relative to 
the stellar mass Re), age or redshift-dependent uncertainties in stel- 
lar masses (owing to e.g. unaccounted-for contributions of AGN 
star light in young stellar populations), and seeing/observational 
effects (e.g. possible redshift-dependent fitting biases from limited 
dynamic range and resolution effects, or issues arising from differ- 
ent choices of definition of effective radii and different distributions 
of galaxy shapes in different samples). 

In each case, we show how these different mechanisms should 
move systems through the size-mass space with redshift, and 
how this relates to other observable properties, including spheroid 
velocity dispersions, central stellar mass densities, light profile 
shapes, and black hole-host galaxy correlations. 

Given this, we construct the predictions from each model, if 
it were solely responsible for the observed evolution. In particular, 
we demand that each model obey the observational constraints as 
boundary conditions: newly-formed ellipticals must appear to have 
the appropriate observed size at each redshift, and then, evolved 
forward allowing only the given affect to alter the size, must lie on 
the appropriate parameter correlations of z = galaxies (i.e. have 
velocity dispersions, densities, and profile shapes at z = within 
the range observed for systems of the remnant final mass). This al- 
lows us to construct the corresponding predictions from each model 
in these other observable properties, and in particular in how these 
properties should scale with fixed mass as a function of redshift. 

Comparing these to recent observational constraints, we can 
clearly rule out several of the above models as a dominant driver 
of size evolution. The only model that is consist ent with all of 
these c onstraints is the "late/minor merger" model. iHopkins et al.l 
( l2009al) have shown, for example, that the observed profiles of 
the high-redshift systems are actually very similar to the observed 
cores of massive ellipticals tod ay - they are not mor e dense, by 
even a factor ~a few (see also iBezanson et al]|2009l who reach 
similar conclusions). It appears that the dominant effect has been 
the buildup of extended, lower-density wings, from high redshifts 
to today, yielding higher Sersic indices in low-redshift systems and 
more extended envelopes that drive the effective radii to larger val- 
ues, while leaving the central propert ies relatively int act. Support- 
ing evidence for this comes form Cenarro &. Trujillo (2009), who 
see only weak growth in the velocity dispersions of galaxies at 
fixed mass with redshift, consistent with evolution via buildup of 
low-density material that does not strongly affect a. In contrast, 
an adiabatic contraction or identical dry merger model would pre- 
dict dramatically higher central densities at high redshifts, with cor- 
respondingly larger velocity dispersions. Invoking stellar mass bi- 
ases would predict dramatic inverse evolution in velocity disper- 
sions with redshift. And invoking mass-to-light ratio gradients pre- 
dicts weaker velocity dispersion evolution, and opposite light pro- 
file shape evolution, relative to that observed. 

However, although it is in principle possible to explain the en- 
tire necessary size evolution via "late/minor mergers" without vio- 
lating these observational constraints, both theoretical models and 
independent constraints on e.g. merger rates and the stellar mass 



function suggest that the required number of mergers would be too 
large, at least in the extreme case of factor ~ 6 — 10 evolution in 
sizes of the most massive, early-forming spheroids as required. A 
priori theoretical models and semi-empirical models based on ob- 
served clustering and stellar mass function evolution predict closer 
to a factor ~ 3 — 4 size evolution via this channel, given the ob- 
served/expected merger rates. 

We therefore consider a high-resolution case study, using hy- 
drodynamic merger simulations, of a typical history of such an 
early-forming galaxy in one such cosmological model. We show 
that, as required by the observations, the minor/late merger chan- 
nel does dominate the size evolution, yielding a factor ~ 3 — 4 size 
evolution with a moderate (factor ~ 1.5 — 2) stellar mass growth. 
However, the other effects described here also all contribute some 
apparent or real size evolution. Each of the them contributes a rel- 
atively small effect, ^ 20 — 30%. Mass-to-light ratio gradients are 
inevitably present at all times, contributing a net effect at this level 
via slight bias towards smaller S-band Re at early times and larger 
Re at late times, but each at the ~ 10% level. Observational fit- 
ting/resolution effects, and/or small shifts in the typical flattening 
of systems in observed samples coupled to the adoption of a "circu- 
larized" effective radius definition, can contribute a relatively small 
~ 20% to size estimates, for the observed and model profile shapes. 
Given the stellar ages, star formation rates, and masses already in 
place at high redshift, and low-redshift constraints on star formation 
histories of massive ellipticals, there could not be a very large gas 
supply awaiting removal in the compact passive systems, but they 
will inevitably lose ^ 20% of their mass to later stellar evolution, 
and will adiabatically expand as a result. Also, given the observed 
photometry, stellar mass estimates appear uncertain at less than the 
~ 20% level. And the earliest (highest redshift) mergers after the 
system first forms are likely to be similarly gas-rich and/or com- 
pact, contributing an average of ~ 20% in mass via the "identical" 
dry (or mixed) mergers channel. 

Each of these effects is small; together, however, they con- 
tribute a net additional factor ~ 2 — 3 size evolution. This is the 
remaining factor needed to explain the observed trends. The pre- 
dictions for other quantities such as velocity dispersions, central 
densities, and profile shapes, in such a realistic model, are still very 
similar to the minor/late merger track predictions (not surprising, 
given that this effect still dominates), and as such consistent with 
observations. But with a net mass evolution of just a factor ~ 2 — 3, 
consistent with merger rates and stellar mass function evolution, 
the system will expand by nearly an order of magnitude in appar- 
ent Re. The combination of secondary effects - all similarly impor- 
tant - explains the remaining factor needed to reconcile models of 
e.g. mergers alone and the observations. Moreover, because each 
of these effects is, individually, small, the success of theoretical 
models does not critically depend on any one of the effects operat- 
ing (other than, of course, late/minor mergers). The size evolution 
of even the most massive, early forming spheroids, therefore, ap- 
pears to be a natural consequence of hierarchical, merger-driven 
spheroid formation; but understanding this evolution in detail re- 
quires accounting for a number of effects related to e.g. stellar pop- 
ulations, small deviations in profile shape, multi-component stellar 
mass profiles, and different mergers that can only be followed in 
high-resolution simulations. 

Interestingly, it also appears that only the "late/minor merger" 
channel provides a means for evolution in the BH-host galaxy cor- 
relations, in the sense of more massive BHs at high redshift rel- 
ative to their host galaxies. The other mechanisms predict either 
no or inverse evolution in these correlations, which both appear 
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to conflict with the observations and would make reconciling the 
BH population and high-redshift quasar population (which requires 
a large fraction of the presen t-day massive BH pop ulation be ex- 
tant and active at these times jHopkins et a"Lll2007ch ) difficult. The 
full model above appears to be consistent with recent observations 
of such evolution, but observational uncertainties remain large. In 
this scenario, early-forming BHs are relatively massive, owing to 
the deep potential wells of their compact hosts; the spheroid mass 
can then "catch up" to the z = relation observed via accretion of 
lower density material - e.g. gas-poor disks or spheroids formed 
from mergers of such disks, with rel atively small pre-exist ing BH 
masses. For more details, we refer to lHopkins etal] j2009bh . 

Further observational tests will present valuable constraints 
of these models. We have outlined predictions for a number of 
basic properties, for which observations have only just become 
possible at intermediate redshifts. Better observations as a func- 
tion of mass, redshift, and galaxy morphology remain critical. 
The present velocity dispersion, central stellar density, and pro- 
file shape information is limited to samples of a few objects and, 
often, stacked images. Direct construction of surface density pro- 
files, in order to measure the mass at low stellar densities as a 
function of redshift, can map out the effects of less dense later 
mergers. Imaging in multiple bands (yielding e.g. color gradients) 
can greatly constrain the role of stellar mass-to-light ratio gradi- 
ents and stellar mass biases. Better constraints on stellar popu- 
lations at low redshift - in particular stellar population age gra- 
dients - will constrain how efficient stellar mass loss must be, 
and correspondingly the role of adiabatic expansion. Kinematics 
of galaxies at all redshifts represent a powerful constraint both 
on the original, gas-rich merger history (allowing direct tests at 
high redshift of the hypothesis, which appears successful at low 
redshift, that gas-richness dominates the determination of the rem- 
nant size) and on the subsequent merger history, potentially map- 
ping between ga s-rich or equal density mergers and dry, lo wer- 
densi t y mergers (iMihos & Hernquisl 1994a; Barnes & Hernguisj 
199^; Naab et al."2006a; Cox et al. 2006b; Robertson et al. 2006a; 
Oiiorbe et al. 2006 ; Jesseit etal. 2007 ; Hopkins et aU. 2009b, e). 



We wish to emphasize that the various observations dis- 
cussed here, and the magnitude or nature of size evolution, do 
not themselves strongly constrain whether or not the "late/minor 
accretion" channel is actually dominated by a few "major" or a 
larger number of "minor" mergers. Provided that the same to- 
tal amount of low-density material (stellar mass) is added to a 
system, in a dissipationless/collisionless fashion, it makes no dif- 
ference whether or not it is "brought in" by a major or minor 
merger. In other words, so long as the major companions are suf- 
ficiently low-density (unlike in the "identical dry mergers" case, 
where they are, by definition, high-density), the characteristic stel- 
lar densities and radii of that material in the post-merger rem- 
nant will be the same as if the material was merged via a se- 
ries of much more minor mergers. T his follows necessarily from 
basic phase-space co nsiderations (e.g. iGallagher & Ostrikerlll972l ; 
iHemguist et al.l I993i). and has been seen as well in various nu- 



merical studie s < Boumaud et al.ll2003;lBovlan-K olchin et al J2006I ; 
iHopkins et aTll2009el;lNaab et al.ll2009( ; lFeldmann et al.ll2009l) . Re- 



alistic cosmological models predict that both major and minor 
mergers contribut e comparably to this a ggregation of low-density 
material (see e.g. iHopkins et al. I l2009g| . and references therein), 
with major mergers dominating at masses near and somewhat 
above ~ L,, and minor mergers becoming progressively more im- 
portant at higher masses (essentially, most of the mass is near ~ L* ; 



so a "typical" z = 0, lOL, system will reflect buildup from ~1: 10 
mergers). 

There will of course be considerable galaxy-to-galaxy vari- 
ation in merger histories, and some fraction ^ 10% of sy stems 
may escape much subsequent merging l lHopkins et alj2009(j ). sug- 
gestively similar to the recently-observed abundance of popula- 
tions of present-day compact sy stems in gal axy clusters, their ex- 
pected environments ( Valentin uzzi et al ]|2009;) . Distinguishing ob- 
servationally between e.g. a series of minor mergers and a couple 
of major mergers will require additional, independent checks. Ob- 
viously, direct constraints on merger rates/fractions as a function 
of mass, redshift, and mass ratio are important. Also, higher-order 
kinematics, such as e.g. galaxy isotropies and the presence/absence 
of certain kinematic subsystems can distingui sh between different 
merger histories (see references above and e.g. lBurkert et alj2008l ; 
Hoffman et al. 2009). These constraints will be particularly valu- 
able in informing theoretical models that attempt to follow the de- 
tailed formation histories of such systems. 

However, it should also be emphasized, as above, that the vast 
majority of ellipticals/spheroids do not form via these high redshift, 
compact channels. The observed mass density of bulge-dominated 
galaxies at z ~ 2 is only ~ 5% of its z = value (and this is 
probably an upper limit, given the expected contamination in how 
such systems are selected; see e.g. iDaddi et al.|[2005l ; iLabbe etd] 
I2OO5I ; (Orazian et al]|2007l) ; by z = I it has inc reased but is still 
only ~ 20 - 35% o f its z = value jBundv et al. 2005,, |2006|; 
[Abraham et alj|2007l) . So most bulges are formed at relatively late 
times, where they are not observed to be especially compact, and 
require no subsequent merging or other activity to change their pro- 
file shapes/sizes. This is particularly important for the "cusp" el- 
liptical population, of rapidly rotating, diskier ellipticals and SO's, 
which are believed to be the direct products of gas-rich mergers 
(and carmot "tolerate" much subsequent dry merging without dis- 
rupting these properties; se e e.g. Naab et al. 2006b; Iciotti et af] 
I2OO7I; [Hopkins et al.l 120093 ; ICox et all 12009 ); these dominate the 
buIge/spheroid/eUiptical mass density, and dominate the popula- 
tion by number at all but the highest masses > lO" Mq , the regime 
where "core" ellipticals (believed to have survived dry mergers) be- 
come important, in good agreement with the models for evolution 
considered here. 
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